AITopics | task specification

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.41)
Information Technology > Artificial Intelligence > Machine Learning (0.41)

Neural Information Processing SystemsFeb-12-2026, 00:47:18 GMT

439539557e9ba0d04055773ff1f3241c-Paper-Datasets_and_Benchmarks_Track.pdf

large language model, machine learning, natural language, (19 more...)

Country: North America > United States > Iowa (0.04)

Genre: Research Report (0.93)

Industry: Law (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Cognitive Science (0.93)

arXiv.org Artificial IntelligenceDec-8-2025

Correspondence-Oriented Imitation Learning: Flexible Visuomotor Control with 3D Conditioning

Cao, Yunhao, Bhaumik, Zubin, Jia, Jessie, He, Xingyi, Fang, Kuan

We introduce Correspondence-Oriented Imitation Learning (COIL), a conditional policy learning framework for visuomotor control with a flexible task representation in 3D. At the core of our approach, each task is defined by the intended motion of keypoints selected on objects in the scene. Instead of assuming a fixed number of keypoints or uniformly spaced time intervals, COIL supports task specifications with variable spatial and temporal granularity, adapting to different user intents and task requirements. To robustly ground this correspondence-oriented task representation into actions, we design a conditional policy with a spatio-temporal attention mechanism that effectively fuses information across multiple input modalities. The policy is trained via a scalable self-supervised pipeline using demonstrations collected in simulation, with correspondence labels automatically generated in hindsight. COIL generalizes across tasks, objects, and motion patterns, achieving superior performance compared to prior methods on real-world manipulation tasks under both sparse and dense specifications.

large language model, machine learning, specification, (15 more...)

2512.05953

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Neural Information Processing SystemsNov-20-2025, 14:31:37 GMT

Bayesian Inference of Temporal Task Specifications from Demonstrations

Ankit Shah, Pritish Kamath, Julie A. Shah, Shen Li

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, specification, (19 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > Illinois > Cook County > Chicago (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.83)

arXiv.org Artificial IntelligenceOct-29-2025

Logic-based Task Representation and Reward Shaping in Multiagent Reinforcement Learning

Doshi, Nishant

This paper presents an approach for accelerated learning of optimal plans for a given task represented using Linear Temporal Logic (LTL) in multi-agent systems. Given a set of options (temporally abstract actions) available to each agent, we convert the task specification into the corresponding Buchi Automaton and proceed with a model-free approach which collects transition samples and constructs a product Semi Markov Decision Process (SMDP) on-the-fly. Value-based Reinforcement Learning algorithms can then be used to synthesize a correct-by-design controller without learning the underlying transition model of the multi-agent system. The exponential sample complexity due to multiple agents is dealt with using a novel reward shaping approach. We test the proposed algorithm in a deterministic gridworld simulation for different tasks and find that the reward shaping results in significant reduction in convergence times. We also infer that using options becomes increasing more relevant as the state and action space increases in multi-agent systems.

agent, artificial intelligence, machine learning, (16 more...)

2510.23615

Country:

North America > United States > Massachusetts > Worcester County > Worcester (0.04)
Asia > Middle East > Republic of Türkiye > Aksaray Province > Aksaray (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

arXiv.org Artificial IntelligenceOct-20-2025

Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information

Corazza, Jan, Aria, Hadi Partovi, Kim, Hyohun, Neider, Daniel, Xu, Zhe

Reinforcement learning (RL) algorithms can find an optimal policy for a single agent to accomplish a particular task. However, many real-world problems require multiple agents to collaborate in order to achieve a common goal. For example, a robot executing a task in a warehouse may require the assistance of a drone to retrieve items from high shelves. In Decentralized Multi-Agent RL (DMARL), agents learn independently and then combine their policies at execution time, but often must satisfy constraints on compatibility of local policies to ensure that they can achieve the global task when combined. In this paper, we study how providing high-level symbolic knowledge to agents can help address unique challenges of this setting, such as privacy constraints, communication limitations, and performance concerns. In particular, we extend the formal tools used to check the compatibility of local policies with the team task, making decentralized training with theoretical guarantees usable in more scenarios. Furthermore, we empirically demonstrate that symbolic knowledge about the temporal evolution of events in the environment can significantly expedite the learning process in DMARL.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

doi: 10.1007/978-3-032-06106-5_5

2506.07829

Country: North America > United States (0.67)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Neural Information Processing SystemsOct-10-2025, 00:44:38 GMT

ClevrSkills: Compositional Language and Visual Reasoning in Robotics

Robotics tasks are highly compositional by nature.

criteria, dataset, obj, (15 more...)

Country: North America > United States > Iowa (0.04)

Genre: Research Report (0.93)

Industry: Law (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)

Nandan, Aditey, Kumar, Viraj

Ambiguity Resolution with Human Feedback for Code Writing Tasks

arXiv.org Artificial IntelligenceAug-21-2025

Specifications for code writing tasks are usually expressed in natural language and may be ambiguous. Programmers must therefore develop the ability to recognize ambiguities in task specifications and resolve them by asking clarifying questions. We present and evaluate a prototype system, based on a novel technique (ARHF: Ambiguity Resolution with Human Feedback), that (1) suggests specific inputs on which a given task specification may be ambiguous, (2) seeks limited human feedback about the code's desired behavior on those inputs, and (3) uses this feedback to generate code that resolves these ambiguities. We evaluate the efficacy of our prototype, and we discuss the implications of such assistive systems on Computer Science education.

large language model, machine learning, natural language, (18 more...)

2508.14114

Country: Asia > India (0.05)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)

Ajeleye, Daniel, Trivedi, Ashutosh, Zamani, Majid

Physics-Informed Reward Machines

arXiv.org Artificial IntelligenceAug-21-2025

Reward machines (RMs) provide a structured way to specify non-Markovian rewards in reinforcement learning (RL), thereby improving both expressiveness and programmability. Viewed more broadly, they separate what is known about the environment, captured by the reward mechanism, from what remains unknown and must be discovered through sampling. This separation supports techniques such as counterfactual experience generation and reward shaping, which reduce sample complexity and speed up learning. We introduce physics-informed reward machines (pRMs), a symbolic machine designed to express complex learning objectives and reward structures for RL agents, thereby enabling more programmable, expressive, and efficient learning. We present RL algorithms capable of exploiting pRMs via counterfactual experiences and reward shaping. Our experimental results show that these techniques accelerate reward acquisition during the training phases of RL. We demonstrate the expressiveness and effectiveness of pRMs through experiments in both finite and continuous physical environments, illustrating that incorporating pRMs significantly improves learning efficiency across several control tasks.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

2508.14093

Country:

North America > United States (0.46)
Oceania (0.28)

Genre: Research Report > New Finding (0.48)

Industry:

Energy (0.67)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Artificial IntelligenceAug-21-2025

Accelerating Signal-Temporal-Logic-Based Task and Motion Planning of Bipedal Navigation using Benders Decomposition

Ren, Jiming, Lin, Xuan, Mineyev, Roman, Feigh, Karen M., Coogan, Samuel, Zhao, Ye

--T ask and motion planning under Signal T emporal Logic constraints is known to be NP-hard. A common class of approaches formulates these hybrid problems, which involve discrete task scheduling and continuous motion planning, as mixed-integer programs (MIP). However, in applications for bipedal locomotion, introduction of non-convex constraints such as kinematic reachability and footstep rotation exacerbates the computational complexity of MIPs. In this work, we present a method based on Benders Decomposition to address scenarios where solving the entire monolithic optimization problem is prohibitively intractable. Benders Decomposition proposes an iterative cutting-plane technique that partitions the problem into a master problem to prototype a plan that meets the task specification, and a series of subproblems for kinematics and dynamics feasibility checks. Our experiments demonstrate that this method achieves faster planning compared to alternative algorithms for solving the resulting optimization program with nonlinear constraints. A project website can be found at http: //bipedal-stl.github.io/. Note to Practitioners -- Bipedal robots are increasingly demanded in warehouses and factories for complex automation tasks such as stacking, delivering, and interacting with other robots under strict time and safety constraints. However, planning such operations under formal language instructions such as Signal T emporal Logic (STL) specifications often results in large-scale mixed-integer programs that are impractical to be solved in a timely manner . This paper introduces an accelerated task and motion planning (T AMP) approach via Benders Decomposition that splits the task into a high-level scheduling problem and lower-level motion feasibility checks, allowing practitioners to find feasible and optimal task and motion plans far more efficiently. Compared to conventional monolithic solvers or alternative decomposition methods, our approach can generate solutions more than twenty times faster while rigorously satisfying kinematic and dynamic constraints. Benchmark scenarios, including factory delivery and warehouse logistics, demonstrate how our method handles realistic automation scenarios involving long planning horizons and complicated task specifications.

artificial intelligence, constraint, formulation, (17 more...)

2508.13407

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry: Transportation (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)